Marker-assisted Selection in 
Backcross Breeding 

SJ. Openshaw 

Pioneer HUBred Ml Inc., P a Box 1004, Johnston, IA 50131 
S.G.Jarboe 1 

CIMMYT, Lisboa 27, Apdo. Postal 6-641, 06600 Mexico, D.R, Mexico 
W.D. Beavis 

Pioneer HUBred IntL Inc., R O. Box 1004, Johnston, IA 50131 

Abstract The backcross breeding procedure has been osed widely to transfer simply inherited traits into elite genotypes. 
Genetic markers can increase the effectiveness of backcrossing by 1) increasing the probability of obtaining a suitable 
conversion, and 2) decreasing the tirnc required to achieve an acceptable recovery. Simulation and field results indicated 
that, for a genome consisting of ten 200-cM chromosomes, basing selection on 40 or 80 markers in 50 BC individuals that 
carry the allele being transferred can reduce the number of backcross generations needed from about seven to three. 

The backcross breeding procedure has been used widely 
to transfer simply inherited traits into elite genotypes. 
Usually, the tralc being transferred is controlled by a 
single gene, buthighly heritable traits that arc more complexly 
inherited have also been transferred successfully by baeliross- 
ing; for example, maiurity in maize (Rlnke and Sentz, 1961; 
Shavet, 1976). Today, backcrossing is being used to transfer 
genes introduced by such techniques as transformatioa or 
mutation into appropriate germplasm. 

Several plant breeding textbooks give good descriptions of 
the backcross procedure (Allard, 1960; Fehf, 1987). A donor 
parent (DP) carrying a trait of interest is crossed to the recurrent 
parent <RP), an elite line that Is lacking the U"ak. The F, is 
crossed back to the RP to produce the BC ( generation. In the 
BC, and subsequent backcross generations, selected individu- 
als carrying the gene beii\g transferred are backcrossed to the 
BP, The expected proportion of DP genome is reduced by half 
wjtb each generation of backcrossing. Ignoring effects of link- 
age to the selected DP allele being transferred* the percentage 
recurrent parent (%RP) genome expected in each backcross 
generation is calculated as: 



where n is the number of backcrossea. 

Backcrossing of selected plants to the RP can be repeated 
each cycle until a line is obtained thai as essentially a version of 
the RP that includes the infrogressed allele. After six back- 
crosses, the expected recovery is >99% (Table 1). 

Until recently t discussions of the recovery of the RP genome 
during backcrossing have emphasized the expected values for 
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%PP shown in Table I, and have largely ignored the genetic 
variation for %RP that exists around the expected mean. With 
the development of genetic markers capable of providing good 
genome coverage, there has been interest in taking advantage of 
that variation to increase the efficiency of backcrossing. 

Selection for RP marker alleles can increase greatly the 
effectiveness of backcross programs by allowing the breeder to * 
1) select backcross plants that have a higher proportion of RP 
genome, and 1) select backcross Individuals that hre better 
conversions near a mapped donor allele being transferred (i.e., 
select for Jess linkage drag). Expressed in practical terms, using 
generic markers to assist backcrossing can 1) increase the 
probability of obtaining a suitable conversion, and 2) decrease 
the lime required to achieve an acceptable recovery. 

Issues to consider when planning a marker-assisted back- 
cross program Include I) the dme advantage of using markers 
to assist backcrossing, 2) the number of markers needed, and 3) 
the number of genotypes to evaluate. In this report, we use 
results from previous literature, computer simulation, and em- 
pirical studios to provide some guidelines. 



T*W« 1, Expected recovery of recurrent parent (RP) genome during 
txidccrtSsiAg, assuming linkage to the gene being xrcKtJerred> 
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Materials and methods 

The maize genome was the model for the simulation. The 
, simulated genome confined ten 200-cM chromosomes. Simu- 
lation of crossing over was based ort a Poisson distribution with 
a mean of 2.0 (X = 2) (Hanson, 1959), which, on average, 
generated one cross over for every 100-cM length. The Simula- 
dofls reported here assume no interference. Codominant jje- 
' fistic markers were evenly distributed in the genome and sites 
of the donor gene were randomly assigned to genome locations. 
' Simulations were conducted with the following parameters; 

Number of progeny: 100 or 500. 

Backcross generations: BC p BC r and BC r 

Number of markers: 20, 40, 80, or 100. 

Number selected to form the nextBC generation: 1 or 5. 

Selection was baaed on 1) presence of the donor allele and 2) 
high %RP). %RP was calculated as the average of the (one or 
five) selected individuals. Values presented are the mean of 50 
simulations. 

.Results 

In the computer simulation study, all methods modeled 
greatly increased the speed of recovering the RP genome 
, compared to the expected recovery with no marker-assisted 
• selection (compare Tables 1 and 2). A: least 80 markers were 
. required to recover 99% of the RP genome in just three BC 
generations (Table 2). Use of at least 80 markers and 500 
progeny allowed recovery of 9S% RP in Just two BC genera- 
tions. Response to selection was diminished only slightly by 
spreading the effort over five selections. Using markers, the 
number of backcross generations needed to convert an Inbred is 



reduced from about seven to three. 

By the BC, generation, there appears to be no practical 
advantage to using S00 vs. 100 individuals. If the presence of 
the donor trait in the backcross individuals can be ascertained 
before markers are genotyped, then only half the number of 
individuals indicated in the tables will need to be analyzed. 

When a small number f markers arc used, they quickly 
became non-informative; i.e., Selection causes the marker loei 
to became fixed for the RP type before the rest of the genome 
is fully con verted (Tabic 3; Hospital et ah, 1992). This sicuadon 
was most prominent in the largBr populations, where s higher 
selection intensity placed more selection pressure upon the 
marker loci. Accordingly, it is of interest to consider how 
closely the estimation of %W? based on markers reflects the 
actual genome composition. The combination of estimation of 
9bRP based on fewer markers and subsequent a election tends to 
bias the estimates upwud (compare Tables 2 and 3), 

The results from the simulation compare well with real field 
data. InatypicalcxampIc^OBCjpIantscarryingthe gene being 
transferred were genotyped at S3 polymorphic RFLP loci (note 
that this corresponds to a population size of 100 unselected 
plants in Tables 2 and 3). The five best BC, recoveries had 
estimated %RP values of 85.9%, 82.7%, 82-0%, 81,4%, and 
81.2%. After evaluating lOBCjplants from each selectedBC,, 
the best BCj recovery had an estimated %RP of 94.6%. 

Discussion 

The simulations (Tabic 2; Hospital et ai. t 1992) and our 
experience indicate that four markers per 200-cM chromosome 
is adequate to gready increase the effectiveness of selection in 
the BC,. However, using cnly four markers per 200 cM will 
likely make it very difficult to map the location of the gene of 
interest. Adequate summarization of the data is an important 
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; pari of a marker-assisted backcross program. Ideally, the mark- 
t& used can supply data that can be represented as alleles of loci 
' vrfth known map position. Estimation of %RX\ mapping the 
position of the locus of interest, and graphical display of the 
jesulis (Young and Tanfcslcy, 1989) are all useful in under- 
paying and controlling the specific backcross experiment 
•being conducted. 

It appears that, with the use of genetic markers, the portion 
. of the 5P genome that Is not linked to the allele being trans- 
.{erred can be recovered quickly and with confidence. The 
recovery of RP will be slower on the chromosome carrying the 
' .gene of interest A considerable amount of linkage drag is 
* expected to accompany selection for the DP allele in a back- 
, .cross program. For a locus located in the middle of a 200-cM 
chromosome, the length of the DP chromosome segment ac- 
, companying selection is expected to be 126, 63, and 28 cM in 
' the BC , BC 3 , and BC T generations, respectively (Hanson, 
' J 959; Naveira and Barbadilla, 1992). Our observations support 
the recommendation of Hospital ec al. ( 1992) that preference be 
/given to the selection for recombinants proximal to the allele of 
/.interest, but that selection for recovery of the RP elsewhere In 
the genome also be considered This two-stage selection can 
probably be done quite effectively ad hoc by the breeder once 
; the data is adequately summarized; however, Hospital ct al. 



suggest ways to Incorporate the two criteria Into a selection 
index such that each component of selection Is assured appro- 
priate weighting. 

Useof genetic markers can greatly increase the effectiveness 
of backcrassing, and they should be used in any serious back- 
crossing program if resources are available to the breeder. 
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